Mechanistic interpretability

Interpretability

To understand the specific, step-by-step computational mechanisms inside the model. What is the “algorithm”?

Aims to identify and map Neuronal circuits.

See Olah2020zoom for the initial take by Chris Olah.

People